Data Mining for the Discovery of Ocean Climate Indices
نویسندگان
چکیده
* This work was partially supported by NASA grant # NCC 2 1231 and by Army High Performance Computing Research Center cooperative agreement number DAAD19-01-2-0014. The content of this work does not necessarily reflect the position or policy of the government and no official endorsement should be inferred. Access to computing facilities was provided by the AHPCRC and the Minnesota Supercomputing Institute. ABSTRACT Ocean climate indices (OCIs), which are time series that summarize the behavior of selected areas of the Earth’s oceans, are important tools for predicting the effect of the oceans on land climate. In this paper we describe the use of data mining to discover Ocean Climate Indices (OCIs). In particular, we apply a shared nearest neighbor (SNN) clustering algorithm to cluster the pressure and temperature time series associated with points on the ocean, yielding clusters that represent ocean regions with relatively homogeneous behavior. The centroids of these clusters are time series that summarize the behavior of these ocean areas, and thus, represent potential OCIs. To evaluate cluster centroids for their usefulness as potential OCIs, we must determine which cluster centroids significantly influence the behavior of welldefined land areas. For this task, we use a variety of approaches that analyze the correlation between potential OCIs and the time series (e.g., of temperature or precipitation) which describe the behavior of land points. Based on these approaches, we have identified some cluster centroids that are almost identical to well-known OCIs, e.g., the Southern Oscillation Index (SOI) and the North Atlantic Oscillation (NAO). We also introduce two strategies for validating potential OCIs which do not correspond to well-known (and probably “stronger”) OCIs, namely, focusing on the correlation between “extreme” events on the ocean and land and looking for more persistent patterns of correlation.
منابع مشابه
Temporal Data Mining for the Discovery and Analysis of Ocean Climate Indices
* This work was partially supported by NASA grant # NCC 2 1231, NSF grant Number EIA 9818338, and by Army High Performance Computing Research Center cooperative agreement number DAAD19-01-2-0014. The content of this work does not necessarily reflect the position or policy of the government and no official endorsement should be inferred. Access to computing facilities was provided by the AHPCRC ...
متن کاملThe Application of Clustering to Earth Science Data: Progress and Challenges
The work described in this paper was conducted as part of the NASA funded project, Discovery of Changes from the Global Carbon Cycle and Climate System Using Data Mining, which was part of the Intelligent Systems (NRA2-37143) program. The goal of this project was to better understand global scale patterns in biosphere processes, especially relationships between the global carbon cycle and the c...
متن کاملDiscovery of Changes from the Global Carbon Cycle and Climate System Using Data Mining
The goal of our NASA sponsored project, “Discovery of Changes from the Global Carbon Cycle and Climate System Using Data Mining,” is to better understand global scale patterns in biosphere processes, especially relationships between the global carbon cycle and the climate system. To that end, we have developed data mining techniques to efficiently find spatio-temporal patterns in large Earth Sc...
متن کاملEfficient Rule Discovery in a Geo-spatial Decision Support System
This paper describes the application of data mining techniques in a Geo-spatial Decision Support System, which focuses on drought risk management. Association rule discovery is one of the widely used approaches in data mining. This paper highlights the rule discovery algorithms that we have developed and used for discovering useful patterns in ocean parameters and climatic indices to monitor dr...
متن کاملEfficient Rule Discovery in a National Drought Decision Support System∗
This paper describes the application of data mining techniques in a National Drought Decision Support System, which focuses on drought risk management. Association rule discovery is one of the widely used approaches in data mining. This paper highlights the rule discovery algorithms that we have developed and used for discovering useful patterns in ocean parameters and climatic indices.
متن کامل